A New Cosine Series Antialiasing Function and its Application to Aliasing-Free Glottal Source Models for Speech and Singing Synthesis

نویسندگان

  • Hideki Kawahara
  • Ken-Ichi Sakakibara
  • Masanori Morise
  • Hideki Banno
  • Tomoki Toda
  • Toshio Irino
چکیده

We formulated and implemented a procedure to generate aliasing-free excitation source signals. It uses a new antialiasing filter in the continuous time domain followed by an IIR digital filter for response equalization. We introduced a cosineseries-based general design procedure for the new antialiasing function. We applied this new procedure to implement the antialiased Fujisaki–Ljungqvist model. We also applied it to revise our previous implementation of the antialiased Fant– Liljencrants model. A combination of these signals and a lattice implementation of the time varying vocal tract model provides a reliable and flexible basis to test fo extractors and source aperiodicity analysis methods. MATLAB implementations of these antialiased excitation source models are available as part of our open source tools for speech science.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Implementations of synthesis models for speech and singing

The current implementations of the synthesis models for speech and singing are described. An improved model for speech is presented and compared to the model currently in use. A new singing synthesis model has recently been implemcn~ed in a signal-processing board. The differences between these models are pointed out. Test results from comparative measurements on synthetic speech synthesis arc ...

متن کامل

Voice source model for continuous control of pitch period.

The voiced speech waveform may be synthesized by exciting an LPC vocal tract filter with a pulse waveform patterned after naturally occurring glottal airflow pulses. Such a pulse waveform may be generated by computing samples of a piecewise polynomial curve at equally spaced time intervals. In this type of synthesis, the pitch period is commonly restricted to an integer multiple of the sample i...

متن کامل

Glottal source modeling for singing voice synthesis

Naturalness of sound quality is essential for singing-voice synthesis. Since 95% of singing is voiced sound (Cook, 1990), the focus of this paper is to improve the naturalness of the vowel tone quality via glottal excitation modeling. We propose to use the LF-model (Fant et al., 1985) for the glottal wave shape in conjunction with pitch-synchronous, amplitude-modulated Gaussian noise, which add...

متن کامل

How to precisely measure the volume velocity transfer function of physical vocal tract models by external excitation

Recently, 3D printing has been increasingly used to create physical models of the vocal tract with geometries obtained from magnetic resonance imaging. These printed models allow measuring the vocal tract transfer function, which is not reliably possible in vivo for the vocal tract of living humans. The transfer functions enable the detailed examination of the acoustic effects of specific artic...

متن کامل

Voiced Speech Synthesis Using Pitch Asynchronous Code Excited Linear Filters for the Glottal Source

This paper proposes a model for natural quality voiced speech synthesis using code excited linear all-pole filter for modeling the glottal source signal. Classical glottal signal models are explicit-time functions which inhibit joint sourcetract parameter estimation and require pitch synchronous estimation with precise segmentation of open and closed glottis phase. These problems are overcome i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017